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Abstract — In this paper, we define a simple discrete mathemati- 
cal model for wireless ad-hoc sensor networks and study the prob- 
lems of data distribution and data collection which arise in those 
networks. We show how those tasks can be optimally performed 
on tree sensor networks that use directional antennas. Further- 
more we compare the performance of a network equipped with 
directional antenna elements with one equipped with omnidirec- 
tional antenna elements and show the former outperforms the lat- 
ter by 50% at most on a line network. 



I. Introduction 

The advancement of Very Large Scale Integration (VLSI) 
technology has contributed much to the development of micro 
sensor systems. Such systems can combine signal processing, 
data storage, wireless communication capabilities and energy 
sources on a single chip. Possibly distributed over a wide area, 
networks of such devices can autonomously perform various 
sensing tasks such as environmental (seismic, meteorological) 
monitoring and military surveillance [1]. These networks are 
referred to as wireless ad-hoc sensor networks or simply sensor 
networks. In sensor networks while each node may be mobile, 
it is typically the case that once the target site of their sensing 
application is reached a semi permanent stationary configura- 
tion is adopted for the purpose of gathering information. 

In the area of general ad-hoc networks as well as sensor 
webs, research has focused on routing [2], medium access con- 
trol (MAC) [3] [4] and physical layer [5]. [6] and [7] are pro- 
tocol suites specifically designed for sensor webs. Theoretical 
results regarding capacity of general static ad-hoc networks first 
appeared in [8]. Also relevant to our research is the so called 
packet routing problem which consists in moving packets of 
data from one location to another as quickly as possible in a 
network and has been extensively studied in conjunction with 
wireline network models [9]. To the best of our knowledge no 
results specific to sensor networks had yet been derived. 

We imagine a sensor network as having two main phases of 
operation (in steady state, after the nodes have organized them- 
selves into a network). In a first phase or measuring phase, 
area monitoring results in an accumulation of data at each sen- 
sor, in the second phase or data transfer, the collected data is 
transmitted to some processing center located within the sensor 
network. In this paper we investigate the efficiency limits of 
such data transfers. 

This work was performed at Caltech and supported by The Lee Center for 
Networking. 



This paper is organized as follows: In section II we give a 
possible model for a sensor network. We considered two types 
of networks, one where nodes are equipped with directional an- 
tennas and one where nodes are equipped with omnidirectional 
antennas. We present our results for the former in section III 
and briefly present our results regarding the latter in section IV. 
We present a comparison analysis of the two systems in section 
V and conclude in section VI. 

II. Model and problem statement 

We define a sensor network as a finite collection of n identi- 
cal sensor nodes {N\, N n }. Each node Ni is associated with 
an integer pi that represents the number of data packets stored 
at this node at the end of the measuring phase. There is one 
special node denoted iV 0 , the processing center, which we will 
refer to as the base station (BS). All the nodes including the BS 
have a common transmission range r. A node (BS included) 
cannot receive and transmit at the same time. The interference 
model as defined in [8] is adopted here. That is, a transmission 
from node Ni to node Nj where i,j > 0 is successful if for 
every other node N k ,k>0 simultaneously transmitting: 

\N { -Nj\< r, \N k - Nj\ > (1 + S)r, 6>0 (1) 

We assume in our model that time is slotted and a one hop trans- 
mission consumes one time slot (TS). The network is further as- 
sumed to be synchronous. A node can only transmit/receive one 
data packet per TS. Multiple transmissions may occur within 
the network in one TS under this interference model by virtue 
of spatial separation. Our network may be represented as a 
weighted rooted graph {V,E,p} where V = {N 0 ,...,N n } y 
E denotes the set of links and p = [jp u ...,p n ). In this graph 
model the root represents the BS (N 0 ) and an edge represents an 
existing wireless connection (a link) between two sensor nodes, 
or a sensor node and the BS (a necessary condition for that con- 
nection to be present is that the distance between two nodes is 
less or equal than the transmission range r). By its nature this 
link is single duplex bidirectional. Our goal is to route the data 
contained at each node to the BS as efficiently as possible. We 
refer to this as the data collection problem. 

III. Directional antenna systems 

A. Line Networks 

In this subsection we consider a line network (an example of 
which is given in Fig. 1) of sensor nodes. A BS is placed at 
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one end of the network. We assume sensor nodes are regularly 
placed along the network. We denote by d the distance be- 
tween any 2 nodes. Assume each node is equipped with direc- 
tional antennas allowing transmissions over a distance r where 
d < r < 2d. Further assume that S is such that (l+5)r < 2d. It 
is straightforward to extend the following results to more gen- 
eral line networks where nodes are randomly placed along a 
line and to different values of r, 5, as long as end to end con- 
nectivity of the network is ensured. Let's denote node N { by 
its distance to the BS in number of hops, that is i. We denote 
i — > i 4- 1 a transmission from node i to node i + 1. Our goal is 
to determine the minimal duration of the transfer phase and an 
associated optimal communication strategy (Note that in gen- 
eral such a strategy is not unique). 

For purpose of solving this problem we look initially at the fol- 
lowing converse problem (which we shall subsequently refer to 
as the distribution problem); instead of nodes sending their re- 
spective packets to the BS, assume the BS is to transmit data 
packets to nodes. The data transfer efficiency remains our con- 
cern. This problem is of separate interest in sensor networks. 
We propose the following simple greedy algorithm for solving 
the distribution problem. We shall prove subsequently it is opti- 
mal. The BS is to send first data packets destined for the furthest 
node, then data packets for the second furthest one and so on, as 
fast as possible while respecting the channel reuse constraints. 
Nodes between the BS and its destinations are required to for- 
ward packets as soon as they arrive (that is in the TS following 
their arrival). Following is algorithm 1 running at the BS. 

Given a line network (represented by the vector Network = 
p), it dictates the BS actions at each time step: remain idle 
(action = 0) or transmit (action = 1). The result is stored in 
the vector action. When an action is chosen the right packet 
is to be handed over to the BS for transmission. One might 
assume that there is a stack of data packets correctly ordered 
with respect to the distance to the BS and that that stack is being 
updated after each BS action so that a packet is popped off the 
stack as it is transmitted. 

Algorithm 1 Determines BS actions 

input: Network 
output: action 

1: step <- 1, legal <- 1, packtsJeft <- £VNetwork(i) 

2: while packtsJeft ^ 0 do 

3: if legal then 

4: action(step) <— 1 

5: packtsJeft <— packtsJeft- 1 

6: legal «- 0 

7: else 

8: action(step) <— 0 
9: legal 1 
10: end if 

11: if packtsJeft < Network( 1 ) then 

12: legal <- 1 

13: end if 

14: step«-step+l 

15: end while 

The procedure is illustrated in the example of Fig. 1 where 
V = {0,1,2,3,4,5,6,7}, £ = + 1),0 < i < 6}, 
p = (2, 0, 0, 0, 3, 0, 1), d < r < 2d, (1 + S)r < 2d. The sched- 
ule of transmissions as determined by algorithm 1 is drawn be- 
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Fig. 1. 8-node line network (pi = 2, p2 = p3 = p4 = p& = 0, ps = 
3, P7 = 1) followed by optimal transmission schedules for the distribution 
(upper schedule) and collection (lower schedule) problems. They are symmet- 
ric of one another. The job is performed in 1 1 TS 



low the network for the distribution and collection problems 
respectively. Either way it is performed in 1 1 TS. 
Next we determine the performance of our algorithm in gen- 
eral. Denote T{ the last busy TS at node i in the execution of 
our distribution algorithm (In the previous example, we have 
Ti = 10,T 2 = 9,T 3 = 10,T 4 = 11,T 5 = ll,r 6 = 7,T 7 = 7). 
Clearly then our algorithm runs in max {Ti}. Ti is a function 

l<*<n 

of the distance to the BS, the number of packets destined for 
node i as well as the number of packets forwarded by node i. 
Lemma 1: Assuming pi = 0 for i > n, we have: 

fi-l + 2Ei> i+ iPi ifft = 0 
Ti = I Pi +2^2^ if i = 1 and Pl > 1 (2) 

[i-2 + 2J2j>iPj if i > 2 and ^ > 1 

Proof: 

Vz > 1, node i is idle the first i — 1 TS. It forwards 2j>»+i Pj 
data packets to further nodes and receives p, data packets that 
are destined for itself. Forwarding a data packet consists in 
receiving that data packet and transmitting it right away and 
therefore a node involved in forwarding one data packet will 
remain busy two consecutive TS. Receiving a data packet on 
the other hand consumes only one TS but in our scheme forces 
node i > 2 to remain silent in the following TS. Therefore, 

Pi >l=^7\ = 2^p J +p 1 

Pi > 1, i > 1 Ti = (i - 1) + 2 £ p,- + 2(p< - 1) + 1 
Pi = 0 =* Ti = (i - 1) + 2 £ Pj 

j>x+l 

■ 

Clearly the maximum of Ti is obtained over the set {i > 
1 1 Pi ^ 0}- We define, for a given sensor network, T u (p) the 
minimum length of a time schedule over all time schedules that 
perform the distribution job. In particular we have: 

Lemma 2: 

T»(p) < max Ti (3) 
{<>i|w*o> 
Next we derive a lower bound on T u (p). 



Lemma 3: Assuming pi = 0 for i > n, we have: 

T tt (p) > max (j - 1 + ft + 2 £ ft) ( 4 ) 

Kt<n 

Proo/: Node i has to forward Pt data packets to 

further nodes. Forwarding one data packet consists in receiving 
and transmitting that data packet and therefore results in a two 
TS consumption (per forwarded packet) . Besides it is itself the 
destination of pi data packets. Each received data packet costs 
at least one TS. Furthermore node i can't be active before it 
receives a data packet which takes at least i — 1 TS. Therefore 
Si = 2 S,-> i+1 Pj + Pi + (i — 1) is a lower bound on any time 
schedule for all i. Hence max 5* is a lower bound on T u (p). 

l<t<n 

■ 

Finally we prove that the lower bound on T u (p) derived in 
lemma 3 equals the upperbound derived in lemma 2 and hence 
that the proposed schedule is optimal. 

Theorem 4: Assuming pi — 0 for i > n, we have: 

T u (p) = max (i - 1 + Pi + 2 ]T Pj ) (5) 

l<i<n * 

i>t+l 

/Voo/; Clearly Si = I\ and S< = Vi > 2 ifpi < 1. 
If pi > 1 then > S*but either p^ x = 0 then S<-i = 7* or 
> lthen5i-i > T<. ■ 
Corollary 5: In the particular case where no two consecutive 
components of vector p equal zero, Theorem 4 reduces to: 



T u (p)=pi+2j2pi 



(6) 



i>2 



We now return to the data collection problem. The construction 
of a schedule here is based on the symmetry of the operations of 
distribution and collection. A time schedule that is symmetric 
to the distribution problem's schedule with respect to a Active 
horizontal axis (see example in Fig. 1) provides us with an opti- 
mal solution, the time to transmit data packets from nodes to the 
BS being indeed the same as the time to carry out the converse 
operation (and being therefore minimal). In particular a trans- 
mission i -> i + 1 occurring at TS j in the distribution problem 
is a transmission i + 1 -> i occurring at TS T u (p) + 1 — j in 
the collection problem. Since the solution to one problem gives 
us the solution to the other, we only consider the distribution 
problem in the sequel. Note that an additional issue is raised in 
the data collection case; indeed the described algorithms don't 
require the network to be synchronous in the distribution case 
(so the algorithms may be run in a distributed way) whereas 
they do in the data collection case. 

B, 2-b ranch networks 

Consider now a line network and place the BS anywhere 
on that line. Another way to look at this problem is to con- 
sider it as a two branch line network (the two branches being 
respectively represented by the data vectors p and q) where 
a one branch line network is the previously considered case. 
We denote T u (p,q) the optimal performance achievable on a 
2-branch network. A general algorithm to distribute data on 
multibranch networks is proposed in the next section. Our pro- 
cedure is illustrated in the following example (Fig. 2) in the 
case where only two branches are present. It is subsequently 



proved to be optimal. The BS transmits in the direction that 
requires most work (as determined by equation (5)) first and 
then alternates transmissions between the two branches. Fur- 
thest nodes are served first then second furthest and so on. A 
data packet is automatically forwarded in the slot that followed 
its reception by a non destination node. 
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Fig. 2. Optimal distribution schedule in a 2-branch sensor network. The job is 
performed in 13 TS 



Theorem 6: Without loss of generality we may assume 
r„(p) >r„(q),then: 



r„(p,q)= { 



Proof: 



'max(T u (p) + 1, E<>i Pi + Qi) 

ifT u (p)=r u (q) 

max(T u (p),J2i>iPi + Qi) 

ifT u (p)>T u (q) 



(7) 



r„(p) = T u (q) => T u (p,q) > max(T n (p) + l^Pi +(/<)) 



T u (p) > r tt (q) lifoq) > max(T u (p), + Qi )) 

It is easy to see why the above described algorithm achieves this 
lower bound (consider for example the case T u (p) = T u (q)) . 
Indeed either the algorithm takes T(p) + 1 TS to perform the 
job or it takes TL X (resp. T{) defined as the last busy TS at 
distance 1 to the left (resp. to the right) from the BS. If it so 
T!_ x (resp. T{) equals £»>i Pi + qi ). ■ 

C. Multibranch networks 

The algorithm running at the BS determines at each TS to- 
ward which branch transmit, if transmission is possible at all. 
The direction of transmission is greedily decided, based on es- 
timates (one per branch) of the completion time of the data 
transfer. Initial estimate for a given branch is determined by 
equation (5). The legal direction associated with the biggest es- 
timate is chosen (a legal transmission is one that respects the 
channel reuse constraints, so for example it is not legal for our 
algorithm to transmit in two successive TS toward a given node 
located at distance greater than 2 from the BS), ties being bro- 
ken randomly. When no legal direction exists the BS remains 
idle. After a decision has been made (transmit toward a par- 
ticular direction or stay idle) the estimates at each branch are 
updated according to the following rule: if a legal direction was 
not chosen, its new estimate is its old estimate plus one. Illegal 
direction estimates remain unchanged. The idea is to minimize 
at each TS the overall estimate of the transmission time. 
Next we illustrate the procedure on an example (Fig. 3). In the 
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accompanying table, we list data transfer completion time esti- 
mates at each TS and the corresponding decision made by the 
BS. As previously stated the initial completion time estimates 
are computed using equation (5). The table reads as follows. TS 
1: All 4 transmission directions are legal. The BS chooses to 
transmit toward branch A. At TS 2, transmitting toward A is not 
a legal move, the legal transmission direction associated with 
the biggest estimate is B, etc. Along a given branch, the pack- 
ets destined for furthest nodes are sent first by the BS. As for 
the other nodes they merely forward the data packets of which 
they are not recipients (a packet is transmitted in the following 
TS that it was received). In this example the algorithm performs 
in 10 TS (an obvious lower bound is 8 TS corresponding to 8 
data packets). It turns out 10 TS is indeed a lower bound on the 
execution time of any schedule. Here we omit the general case 
proof. It may be found in [10]. 



(A) 



2 * 



(D) 



TS A B C D BS 

1 9 7 5 3 A 

2 9 8 6 4 B 

3 9 8 7 5 A 

4 9 8 8 6 B 

5 9 8 9 7 A 

6 8 10 8 C 

7 9 9 B 

8 10 D 
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Fig. 3. Optimal distribution schedule for BS on a 4-branch sensor network. 
The completion time is 10 TS. 



D. Tree networks, case where degree of base station is 1 

Throughout this paragraph we assume that the degree of the 
root of the considered graphs is one. 

Definition 7: We define the equivalent linear network 
(Gi , Ei , Pl ) of a network (G, E, p) . If G = { N 0 , N x , . . . , N n } 
and p = (pi, . . . ,p n ) then Gi = {0, 1, . . . ,m}, Ei = {(i - 
l,i),l < i < ra] and pi = (pa, . . . , pi m ) where m = 
mp(d(JV 0 ,JVi)) andpij = E<|d{jVo f Ar,)=iJ* 
We illustrate a tree network in Fig. 4 (n = 14, m = 7); its 
equivalent linear network is shown in Fig. 1. 
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Fig. 4. A 15-node tree network with degree of BS=1, the equivalent linear 
network is drawn in Fig. 1 . Transmission TS are written next to the edges. 

The equivalent linear network's schedule may serve as a 
schedule for the initial tree network. Next we explain how 
transmission time slots for (Gi,Ei,pi) (determined by run- 
ning algorithm 1) may be mapped onto (G, E,p). Consider 
an element in E, say (Ni 0 ,Nj 0 ), such that d(N 0 ,N io ) = 
a (hops). Based on the number of data packets Nj 0 has 
to forward, say / J0 , we shall allocate transmission time 
slots to edge (N io ,N jo ). Define E a = {(N u Nj) 6 



E\d(N 0 ,Ni) = a}. Each packet P follows a path path(P) 
from the BS to its destination node where path(P) de- 
notes the finite sequence of edges (ei,...,e*) traversed in 
that order by P. For convenience we shall write path(P) 
as the sequence of vertices (vertices(ei), vertices(ek))> 
We define <p a = {P|3e eE a H path(P)}. We de- 
fine T a = {TS used by (a, a + 1) € E t }. We have: 
Wa\ = E (Pj + fj) = £ m =|T tt |. Thus one may 

(NuNrfeEa k>a 

define a one to one correspondence g between ?p a and T a 
that associates the packet P with the longest path in ^3 a , with 
the TS with the smallest index in T tt ; the packet P with 
second longest path, with the TS with second smallest index 

and so on. We finally define <p£*°.*io) ={p| (N io ,N jo ) e 
path(P)}C <p a . (N ioy Nj Q ) is associated with time slots 

gVP { a Nio ' Njo) ). In the example of Fig. 4, we have: {P} = 
{Pi,P 2> - • • ,Pe} where the first packet is characterized by 
path(Pi) = (iVb,JVi, N 2i N^,N 7 ,N s ,Ng ) Ni 0 ) y the second one 
by path(P 2 ) = (^0,^1,^2,^3,^4,^5), the third one by 
path(P 3 ) = (N 0 ,N u N 6 ,Ni 3 ,N 12 ,N n ), the fourth one by 
path(P 4 ) = (No,N u N 2y N 3 ,N 7 ,N 14 ), and finally the fifth 
and sixth ones by path(P^) = path(P&) =(N 0i Ni). We 
also have E x = {(N U N 2 ),(N U N 6 )}^ X = {P U P 2 ,P 3 ,P 4 }, 
Ti = {2,4,6,8}, and y[ N *> N ^ = {P U P 2 ,P 4 }. Thus edge 

(N U N 2 ) is associated with time slots g(^[ NuN2) ) ={2,4,6} 
Thus algorithm 1 run on the equivalent linear network provides 
a BS transmission schedule. Intermediate nodes simply forward 
data packets to further nodes as they arrive (in the TS following 
their arrival). This requires a routing table at junction nodes. 

Although an equivalent linear network has a reduced set of 
possible concurrent transmissions, this procedure produces an 
optimal transmission schedule. The proof omitted here because 
of space limitations is based on the fact that transmissions that 
can occur in one case and not in the other, that is 1) multiple 
transmissions from nodes at distance i (from BS) to nodes at 
distance i + 1 and 2) simultaneous transmissions from node(s) 
at distance i to node(s) at distance i + 1 on one hand and from 
node(s) at distance i — 1 to node(s) at distance i on the other 
hand are not helpful in routing data faster. This is due to the 
fact that any path from the BS to a node necessarily includes 
link (0, 1) which constitutes a bottleneck. 

E. Tree sensor networks, general case 

The results in the previous sections suggest the following al- 
gorithm for dealing with general tree networks. 

1) linearize the subtrees attached to the BS according to the 
procedure described in III.D 

2) apply multibranch algorithm described in section III.C to 
the resulting multibranch system 

By combining results obtained in IELA, C and D, the proce- 
dure is clearly optimal for general tree networks. 

IV. Omnidirectional antenna systems 

Line networks 

Our results readily extend to omnidirectional antenna sys- 
tems. We have, for a one branch line network, using 



an algorithm similar to algorithm 1 and keeping the nota- 
tions/assumptions introduced in section III: 



if p\ = 0, P2 — 0 and 

Ty = < Ei> 3 Pi > i (8) 

Pi + 2p2 + 3 Sj>3 Pi otherwise 



T 2 = 2p2 + 3^pj 

J>3 



(9) 



Vi>3 



-2 + 3E j>4 Pi 
i-3 + SE^iPi 



if p< = Oand > 1 

if p< = 1 

if Pi > 2 



(10) 



Theorem 8: 



T 0 (p) = max {i - 1 + Pi + 2p i+l + 3 Y] Pj } (11) 

i>i+i 

Omnidirectional antennas generate more channel reuse con- 
straints and as a result require longer time schedules. For exam- 
ple if the network of Fig. 1 was equipped with omnidirectional 
antennas, the minimum schedule length would be 14 TS (this 
takes into account the fact that transmissions from node i to 
node i + 1 creates interferences at node i — 1 Vz > 1). This is 
a 27% increase over the directional antenna system. Bounds on 
the relative performance of the two systems are derived in the 
next section. 

V. COMPARISON BETWEEN OMNIDIRECTIONAL AND 

DIRECTIONAL SYSTEMS 

In order to get a better intuition on how the two systems per- 
form relative to one another, we give the following comparative 
result for a line network: 

Theorem 9: 

1 < To(p) <r 1 1 

1 -7UpT <L5 
Proof: observe that: 

• If the maximum of omnidirectional/directional Ti occurs 
in j + 2 then the maximum of omnidirectional Si occurs 
\nj whereas the maximum of directional Si occurs in 
Hence by equations (5) and (1 1), we get: 

T 0 (p) = J - 1 + Pj + 2phi + 3 Ej> i+2 Pi 

2_^>j+2 Pi 



Hence, 



Tote) 



> 1.5 0.5j + 1 < pj + 0.5pj+i = 0 



T u (p) 

But this is not possible since j > 0. 
If the maximum of omnidirectional/directional Ti oc- 
curs in j € {1,2} then the maximum of omnidirec- 
tional/directional Si occurs in j = 1. Therefore by equa- 
tions (5) and (11), we have: 

T Q (p) = Pi + 2j?2 + 3 J2j>3 Pj 
T u (p) Pi + 2p2 + 2£ i > 3 Pj 



Hence, > 1.5 =» p2 + 0.5pi < 0 

T u[P) 

Again this is impossible. Hence the right half of the in- 
equality. 

■ 

To see these bounds are tight, we look at the two following 
(typical) scenarios (n > 3): 

1) p = (!,...,!) By equations (6) and (11), = 



2n-l 



n-foo 



+ 1.5 



2) p = (0,...,0,j? n ) By equations (5) and (11), = 
^§^ifPn>l;gg = lifPn = l. 

VI. Conclusions and future work 

We have proposed optimal strategies to distribute and collect 
data packets from a tree-like sensor network when the distribut- 
ing/collecting node has degree 1 or 2. The exact performance 
times of such strategies have been derived. We have generalized 
those strategies to general tree networks. Finally we compared 
the performance of omnidirectional systems to directional ones. 
We are currently working on extending our comparison analysis 
between directional and omnidirectional systems. Furthermore 
we are looking at the impact of network cycles on the optimality 
of our algorithms. 
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